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Abstract 

We characterize the conditions for the conservation of the energy and of the com¬ 
ponents of the momentum maps of lifted actions, and of their ‘gauge-like’ gener¬ 
alizations, in time-independent nonholonomic mechanical systems with affine con¬ 
straints. These conditions involve geometrical and mechanical properties of the 
system, and are codified in the so-called reaction-annihilator distribution. 
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1. Introduction 

In this paper we study the conservation of the energy and of the components of the 
momentum maps of lifted actions—and of their ‘gauge’ generalizations introduced in 
|3]—in time-independent mechanical systems with ideal nonholonomic constraints that 
are affine functions of the velocities. The conservation of both types of functions is 
affected by the reaction forces exerted by the nonholonomic constraint, but there is a 
difference between them. The conservation of energy depends on certain properties of 
the nonhomogeneous term of the constraint distribution, and thus differs from the case of 
nonholonomic systems with constraints that are linear functions of the velocities. Instead, 
the conservation of momenta and gauge momenta is essentially the same as that in the 
case of linear constraints, which has been extensively studied is 122 ns ns* Therefore, 
we will mainly focus on the conservation of energy. (If the Lagrangian is not quadratic 
kinetic energy minus positional potential energy, then the function we call energy is, 
properly, the Jacobi integral). 

It is well known that the energy is conserved in nonholonomic mechanical systems with 
constraints that are linear in the velocities mm, and that it is not always conserved 
if the constraints are affine, or more generally nonlinear, in the velocities. More than 
specifically in the case of affine constraints, the issue of energy conservation has received 
extensive consideration in the case of general nonlinear constraints 0 HU [301 H3 US 
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mnm- The equation of balance of the energy shows that the energy is conserved 
if and only if the reaction forces exerted by the constraint do not do any work on the 
constrained motions. The quoted references deduce from this some sufficient conditions 
for the conservation of energy, such as the tangency of the Liouville vector field to the 
constraint manifold (as e.g. in |23 03 ]) or the fact that the constraint is a homogeneous 
function of the velocities (as e.g. in mw- However, when particularized to the case of 
affine constraints, as e.g. in [251123], all these sufficient conditions reduce to the linearity of 
the constraints. It seems, therefore, that conservation of energy for nonholonomic systems 
with affine constraints is presently not understood. The main purpose of this paper is to 
remedy this lacune, by identifying the properties of a nonholonomic mechanical system 
with affine constraints that determine the conservation of its energy. 

At the basis of our approach is the fact that, under the hypothesis of ideality of the 
constraints (namely, d’Alembert principle), the reaction force that the constraint exerts 
on the system is a known function of the kinematic state of the system. This function 
depends on properties of the system that are both of geometric nature (the nonholonomic 
constraint) and of mechanical nature (the mass distribution and the active forced). The 
inspection of this function reveals that, for a given system, the set of all reaction forces 
exerted by the constraints on the constrained motions might be (and typically is) smaller 
than the set of all reaction forces that satisfy the condition of ideality. The reason is that 
for a given system the active forces that act on the system are fixed, while the notion 
of ideality makes reference to all possible active forces that might possibly act on the 
system (see m for a discussion of this fact). Consequently, any vector that annihilates 
the linear part of the constraint may be an ideal reaction force, but for a given system, 
the class of reaction forces actually exerted by the constraint may be a subset of this 
annihilator. 

Therefore, the properties of a nonholonomic system that are influenced by the reaction 
forces exerted on constrained motions may depend, in a complicate way, on the geometric 
and mechanical properties of the system, including the active forces. These properties can 
be codified by a distribution on the configuration manifold, which is called the reaction- 
annihilator distribution. This distribution was introduced in |18| in connection with the 
conservation of the momentum map of lifted actions in nonholonomic systems with linear 
constraints, and was further used in 

We will show that a necessary and sufficient condition for energy conservation in 
nonholonomic mechanical systems with affine constraints is that the nonhomogeneous 
term of the constraint is a section of the reaction-annihilator distribution. This clarifies 
in a quantitative, computable way why energy conservation is not a property of purely 
geometric type and, in particular, how it depends on the active forces that act on the 
system: changing just the (conservative) active forces that act on a given system (same 
constraints, same kinetic energy) may destroy, or restore, the conservation of energy. We 
will illustrate these behaviours on some examples. 

We recall the basic facts about nonholonomic mechanical systems with affine con¬ 
straints, and introduce the reaction-annihilator distribution for these systems, in Sec¬ 
tion 2. Energy conservation is studied in Section 3 and the conservation of momenta and 
gauge momenta of lifted actions is concisely studied in Section 4. A short Conclusion 
follows, where we stress the importance of exploiting the knowledge of the reaction forces 
in the study of nonholonomic systems. In the Appendix we derive the expression of the 
reaction force as function of the kinematic state. 

1 By ‘active forces’ we mean the forces that act on the system and are not reaction forces; in part of 
the literature they are called ‘external forces’. 
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Throughout the paper all manifolds and maps are smooth and all vector fields are 
assumed to be complete. For simplicity we restrict our consideration to time-independent 
systems. For introductions to nonholonomic mechanics see e.g. [j2BJ [27J [25] [T2J [25] [7] Til . 

Lastly, we mention that in certain nonholonomic mechanical systems with affine con¬ 
straints, even if the energy is not conserved, there may exist a modification of it (which 
may be interpreted as the energy of the system in a moving reference frame and has 
therefore been called a ‘moving energy’) that is conserved [2D'b 

2. Affine constraints and the reaction-annihilator distribution 

2.1. Nonholonomic systems with ideal affine constraints. Since affine constraints 
appear typically in problems of rigid bodies that roll on moving surfaces, it is appropriate 
to work on a phase space that is a manifold. However, for simplicity we will resort wher¬ 
ever possible to a coordinate description!! Moreover, because of the possible presence 
of moving holonomic constraints in this type of systems, it is natural to allow for the 
presence of gyrostatic terms in the Lagrangian, that may come either from the use of 
non-inertial frames or from the use of moving coordinates. We assume however that the 
system is time-independent, as is it typically happens if the bodies and the surface have 
suitable symmetries and the latter moves at uniform speed. 

Our starting point is thus a Lagrangian system with n-dimensional configuration 
manifold Q and Lagrangian L : TQ —> R, that describes a mechanical system subject to 
ideal holonomic constraints. We assume that the Lagrangian has the mechanical form 

L = T-b-V o-k (1) 

where T is a positive definite quadratic form on TQ, 6 is a 1-form on Q regarded as a 
function on TQ , V a function on Q and 7r : TQ — > Q is the tangent bundle projection. 
Following e.g. E3 we write Lagrange equations as [L\ =0, where [L\ may be regarded 
as a 1-form on Q, whose coordinate expression is the well known 

We add now the nonholonomic constraint that, at each point q € Q, the velocities 
of the system belong to an affine subspace Mq of the tangent space TqQ. Specifically, 
we assume that there are a nonintegrable distribution T> on Q of constant rank r, with 
1 < r < n, and a vector field £ on Q such that, at each point q £ Q, 

■Mq = £(q) + 'Dq ■ 

Clearly, the vector field £ is defined up to a section of V. The affine distribution M with 
fibers Aiq may also be regarded as a submanifold M C TQ of dimension n + r, which 
is actually an affine subbundle of TQ of rank r, and is called the constraint manifold. 
The case of linear constraints is recovered when the vector field £ is a section of the 
distribution V , since then M. =V. 

We assume that the nonholonomic constraint is ‘ideal’, namely, that it satisfies 
d’Alembert principle. This means that, when the system is in a configuration q £ Q, the 
set of reaction forces that the nonholonomic constraint is capable of exerting coincides 
with the annihilator 'D'~ of T>q (see e.g. [126;, [25]). Under this hypothesis there is a unique 

function R^ ^ : M —>- T >°, namely a function that associates an ideal reaction force R^ y f 

2 In the sequel, symbols with a hat denote global objects and the same symbols without the hat their 
local, coordinate representatives. 
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to each constrained kinematic state Vq £ M , which has the property that the restriction 
to M of Lagrange equations with the reaction forces, 

\-L\Im = R-l,m ■> ( 2 ) 

defines a dynamical system on M (that is, a vector field on M). For completeness, we 
give a proof of this fact in the Appendix. 

Definition. Assume that L : TQ — » R is as in m and that M is an affine subbundle 
of TQ. The nonholonomic mechanical system with affine constraints (L, Q, M ) is the 
dynamical system defined by equation on M. 


2.2. Coordinate description. We consider now a system of local coordinates q on Q. 
with domain Q C R n , and lift them to bundle coordinates (q,q) £ Q x R” in TQ. We 
write the local representative of the Lagrangian L = T + b —Von&s 

L(q,q)=^q-A(q)q + b(q)-q-V{q) (3) 

with A(cf) an n x n symmetric nonsingular matrix and b(q) £ R™ 

The fibers of the local representative V of the distribution T> can be described as the 
kernel of a ^-dependent k x n matrix S(q) that has everywhere rank k, with k = n — r: 

Vq = {q £ T q Q = R" : S{q)q = 0} . 


The matrix S is not uniquely defined. However, if Si and S 2 are any two possible choices 
of it, then from ker S± = ker S 2 it follows that there exists a g-dependent k x k nonsingular 
matrix P(q) such that S 2 = PSi. 

Let now A4 be the local representative of Xi , M C Q x R™ that of M and f : Q —> R" 
that of £. Then q £ M q if and only if q = £(q) + u for some u £ ker S(q), that is, if and 
only if S(q)[q — ^(g)] = 0. Thus 

M = {(g, q) £ Q X R" : S(q)q + s(q) = 0} 


with 

s(q) = -S(q)aq) £ R" • 


Note that s is independent of the arbitrariness in the choice of the component of £ along 
V, that is of the component of £ along ker S. However, s depends on the choice of the 
matrix S: if Si and S 2 are relative to two matrices Si and S 2 = PSi, then S 2 = Ps 1 . 

In coordinates, the equations of motion m of the nonholonomic mechanical system 
(L, Q, M) are 


/ d dL 
\dt dq 


dL\ 

~dq) 


M 


Rl,m 


(4) 


where Al,m(9, q) is the local representative of R^ A y. As shown in the Appendix, Rl,m 
equals the restriction to M of the function 


where £ £ R” and a £ 


S T {SA- 1 S T )- 1 (SA~ 1 £-a) 

have componenttH 

d 2 L 
dqidq. 


-<U ^ ITT 


dL 

% 


dS ai . . 8s a 

<Ja = -X- qidj + -X—' 

dqj dq 0 


(5) 

( 6 ) 


3 We use everywhere the convention of summation over repeated indexes. 
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with i,j,h = 1 ,n and a = 1,..., k. Note that 

£ = a + / 3+ V' 

with ai = (l“ §T&r) 9j9h, A = (H - §*)qj and V' = g£. 

In the case of linear constraints, expression 0 or analogue expressions are given in 
13 US IZI; in the Appendix, besides considering the affine case, we complement these 
treatments with a global perspective. 

Remark. The restriction to M of the function (O is independent of the choice of S and 
s, that is, under the replacement of S, s by PS, Ps. (This change produces an extra term 
in a, which however vanishes onM). 

2.3. The reaction-annihilator distribution. While the condition of ideality assumes 
that, at each point q £ Q, the constraint can—a priori—exert all reaction forces that 
lie in 2?|, expression (0 shows that, ordinarily, only a subset of these possible reaction 
forces is actually exerted in the motions of the system. In fact, in coordinates, V°~ is the 
orthogonal complement to ker S(q), namely the range of S(q) T , and the map 

S T (SA~ 1 S T )~ 1 (SA~ 1 £ — er)| : M q -> remgeS{q) T 

might not be surjective. Specifically, the reaction forces that the constraint exerts, when 
the system (. L , Q , M) is in a configuration q G Q with any possible velocity v q G M. q , are 
the elements of the set 

^9 := U Al,m(®0) 

V£Mg 

and this set may be (and typically is) a proper subset of £>?. This point of view was 
taken in m and leads to the following 

Definition. The reaction-annihilator distribution TZ° of a nonholonomic mechanical sys¬ 
tem with affine constraints ( L , Q, M) is the (possibly non-smooth and of non-constant 
rank) distribution on Q whose fiber 1Z 1 - at q £ Q is the annihilator oflZ q . 

The interest of this distribution is not that much geometric, but mechanical: a vector 
field Z on Q is a section of 1Z° if and only if, in all constrained motions of the system, 
the reaction force does zero work on it: (. R^ j^(v q ),Z{q)) = 0 for all v € M. This is a 

system-dependent condition, which is weaker than being a section of T> because 

v q c \/qeQ. 

Expression 0 of the reaction forces shows that 1Z° is computable a priori, without 
knowing the motions of the system. Moreover, this expression shows how 7 Z° depends on 
the geometry of the nonholonomic constraints (through the matrix S and the vector s) 
and on the mass distribution of the system and on the active forces that act on it (through 
the Lagrangian)@ 

Examples of reaction-annihilator distributions 1Z°, that show that their fibers may 
actually be larger than those of T>, are given in mum for systems with linear constraints 
and in sections 3.2 and 4.3 below for the case of affine constraints. For a discussion of 
the relation between 1Z° and d’Alembert principle see [TO] . 

4 it° depends of course also on the holonomic constraint, through the dependence of L on (q. q). 
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3. Conservation of energy in systems with affine constraints 


3.1. Characterization of the conditions for energy conservation. We recall that 
the energy , or more exactly the Jacobi integral , of a Lagrangian L is the function 

E~ l (v) := (p L ,v)-L(v) 


where p £ is the momentum 1-form relative to L, namely fif = FL with F the fiber 
derivative (as defined, e.g., in [T]). In coordinates, Pl = ^ and, if L is as in ©, 
PL(q,q) = A(q)q-b(q). 

If L is as in © then El = T + V on. The function Ef can be properly interpreted 
as the mechanical energy of the system only if L = T—Von but, as is customary, we 
will call it energy in all cases. For a Lagrangian system, E^ is a first integral if and only 
if, as we do assume here, L is independent of time. 

Definition. The energy E^ ^ of the nonholonomic mechanical system with affine con¬ 
straints (L 7 Q,M) is the restriction of E^ to the constraint manifold M. 

Proposition 1. For a nonholonomic mechanical system with affine constraints (L, Q 7 M) 
with constraint distribution V + the energy Ef ^ is a first integral if and only if £ is 
a section ofTZ°. 

Proof The proof can be given in coordinates. By ©, along any curve t ( qt,qt ) G M, 

d . f d dL dL\ . , . . , . 

-j t E L , M (qt,qt) = qt ■ \ ~ ~dq) = 9t ' RL , M {9t,Qt) ■ V) 

If q £ A / l(q), namely q = u + f(q) with u G V qi then 

q) ■ q = RL,M(q , q) • £(<?) 


given that Rl,m is ideal and hence annihilates V q . It follows that El 7 m is a first integral if 
and only if RL,M(q, q) •£(?) = 0 for all q € Q, q G M q , that is f(q) G 1Z° for all q G Q. □ 

This shows that energy conservation is not a universal property of nonholonomic 
mechanical systems with affine constraints. In particular, as we have already stressed, it 
depends on the active forces that act on the system. 

In this respect we note that at each point q G Q, the union of the fibers at q of the 
distributions TZ° relative to all functions V : Q —> R. equals V q . (In fact, in expression 
©, £ = a + ft + V' and the matrix S T (SA~ X S T ) 1 S'A _1 is, at each point q , the 
A(< 7 ) -1 -orthogonal projector onto V°). Thus, it follows from Proposition 1 that, given 
T and M, E^ is a first integral in all nonholonomic mechanical systems of the class 
(L = T — V o 7 r, Q,M), with any V : Q —> M, if and only if the constraint is linear. 
This fact, which can be generalized to allow for the presence of gyrostatic terms in the 
Lagrangian, is a particular case of a result by |31j for general nonlinear constraints (with 
linearity replaced by homogeneity). 

Remark. The result in Proposition [T| does not depend on the choice of £, which is defined 
up to the addition of a section of V , because V C 1Z°. 
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3.2. Examples. 1. An affine nonholonomic particle. In order to illustrate the depen¬ 
dency of energy conservation on the active forces we consider an affine version of Pars’ 
nonholonomic particle [25;. This system has configuration manifold R 3 B q = ( x,y,z ), 
Lagrangian 

L(q,q) = ±\\q\\ 2 ~V( q ) 

with a potential energy V for which we will make different choices, and constraint 


z + xy — yx — c = 0 


with c a nonzero real number. Thus n = 3 and k = 1, £ = cd z and the distribution 
V is spanned by the two vector fields d x + yd z and xd x + yd y . V has rank 2 except 
where y = 0, so we disregard these points and restrict the configuration manifold to 
Q = R 3 \ {y = 0}. Note that not only £ is not a section of V but, after the restriction 
to Q , 

(L{q)iV q VqeQ. ( 8 ) 


The constraint manifold M is diffeomorphic to Q x R 2 and has global coordinates 
(x, y, z, x, y). We may take S(x, y, z) = (— y, x, 1) and s = —c. 

Due to its low dimensionality, this example offers little variety of behaviours. Specif¬ 
ically, the distribution 1Z° depends on the function V but, since its fibers contain those 
of V , that have dimension 2, there are only two possibilities: at a point q, either 
7L\ = T q Q = 

for all (jgR 3 and the second if R.L,M(q, q) / 0 for some q G 

At the same time, however, the low dimensionality makes all computations straight¬ 
forward, and it is simple to find potentials V that exemplify the different possibilities. 


or lZ q = Vq. The former possibility is realized if ?) = 0 


Rl,m = S T (SS T )~ 1 V' 


1 


M 


1 -T x 2 + y 2 


d © 

gives 

xy 2 

-xy 

-xy 

x 2 

-y 

X 


V' 


M 


(9) 


Thus, the reaction forces are independent of the q. This would make it straightforward 
to compute the fibers 1Z°, which are simply the orthogonal complements to in 

R 3 , but we need not doing it because we already know what these fibers are. 

Concerning the conservation of energy, there are two cases to consider: 

1. If V is such that Rl,m(m) = 0 at all points q G Q, then 1Z° = R 3 and energy is 
conserve d| From (0) one verifies that this situation is encountered, e.g., with V = 0 
and V = ^(x 2 + y 2 ). 

2. If V is such that Rl,m is not identically zero, then there is a point q G Q at which 
IZq = Vq. Thus, by ©, £(g) ^ IZq and so £ is not a section of 7 Z°. Hence energy is 
not conserved. An example is V = z. 

A richer variety of possibilities, including (a) conservation of energy even with nonzero 
reaction forces, and (b) violation of the conservation of energy even in the absence of 
active forces, can be easily constructed by considering four- or five-dimensional extensions 
of the nonholonomic particle, similar to that considered in m- However, cases (a) and 
(b) are met also in known mechanical systems. For instance, in the system formed by 

5 That energy is conserved if the reaction forces vanish identically is of course obvious, for a variety 
of reasons. For instance, the nonholonomic system is a subsystem of the unconstrained system with 
Lagrangian L on TQ. 
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a heavy sphere that rolls on a rotating horizontal plane, considered e.g. in 122 , the 
potential energy of the active forces is constant but the energy is not conserved [201 . An 
example of case (a) is the following one. 

2. A sphere rolling inside a rotating cylinder. As a second example we consider the 
system formed by a homogeneous sphere constrained to roll without sliding inside a 
cylinder that rotates with constant angular velocity about its figure axis. We assume 
that the sphere is acted upon by positional forces whose potential energy is a function of 
the position of the sphere’s center of mass. The case of a heavy sphere inside a vertical 
cylinder that is at rest is classical he mi- The case of rotating cylinder, but without 
active forces, is considered in [ 2 ]. 

Let a be the radius of the sphere and r + a the radius of the cylinder. The holonomic 
system we start from is formed by the sphere constrained to keep its center on a cylinder C 
of radius r. Fix an orthonormal frame E = {O; e x , e y , e z } with the origin O on the figure 
axis of the cylinder and e 2 aligned with it. Using cylindrical coordinates (z, 7 ) relative 
to E we identify C with 1x5' and the configuration manifold is Q = R x S 1 x SO(3) 9 
{z, 7 , 1Z ), where the matrix 1Z gives the sphere’s orientation. Up to an overall factor, the 
Lagrangian is then 

L = \ (U 2 7 2 +z 2 ) + ^IMI 2 - V(z,7) ( 10 ) 

where (up to the same factor) I is the moment of inertia of the sphere and V is the 
potential energy of the active forces, and w = (u> x ,uiy,u) z ) is the angular velocity of 
the sphere relative to E (that we think of as a function on TSO(3)). We add now the 
nonholonomic constraint that the sphere rolls without sliding on a cylinder, coaxial with 
C and of radius r + a, that rotates with constant angular velocity Qe z relative to E: 

r'y + au> z — (r + a)Cl = 0, i + a(u> x sin 7 — u y cosy) = 0 . ( 11 ) 

These equations can be solved for (z, 7 ) and the constraint manifold M is thus diffeo- 
morphic to R. x S 1 x TSO(3). 

As local coordinates on Q we use the cylindrical coordinates ( 2 , 7 ) of the center 
of mass of the sphere and three Euler angles {ip,ip,9) £ S 1 x S 1 x (0, 7 r) that fix the 
orientation of a body frame relative to E (we adopt the convention of [3] for the choice 
of these angles). The representative of L is 

L = i (r 2 y 2 + z 2 ) + - {9 1 + p 2 + ip 2 + 2ipip cos 9) - V(z, 7 ) 
and the constraint (HD becomes 

rj + a(<p + ip cos 9) — (r + a) fl = 0 , z + a [ip sin 9 cos(y — <p) + 9 sin (7 — <£>)] = 0 . 

In these coordinates the vector field £ becomes the constant vector field p = fl(<9 7 + d v ), 
or p = (0, fl, f l, 0,0), and a possible choice of S and s is 

g _ A) r a a cos 9 0 

^1 0 0 asin 0 cos (7 — tp) asin (7 — ip) 

As local coordinates on M we may use (z, 7 , ip, ip, 9, ip, ip, 9). From (0, the reaction force 
is then 

Rl,M = J + a 2 (/ 1 K > ~K ’ a f C0S ^ _ v) sin 9 + cos 6 > a f sin (7 - V 3 )) 


s = (0, —(r + a) fl). 
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with f = + cos 9 — II) (9 cos (7 — <p) — ip sin (7 — ip) sin 0) + V' z . Therefore, 

Rl,m ■ £ = —ok; . 

This shows that, when Q 0 and hence £ ^ 0 and the constraint is affine, the energy 
is conserved if and only V depends on z alone. This includes the case of a heavy sphere 
that rolls inside a rotating vertical cylinder, for which V = gz. 

4. Conservation of momenta and gauge momenta of lifted actions 

4.1. Conservation of momenta. We consider now a second problem in which the 
reaction forces of a nonholonomic mechanical system with affine constraints play a role: 
the conservation of the momentum map of a lifted action that leaves the Lagrangian L 
invariant, and of its ‘gauge’ generalization. 

This topic has been widely studied in the case of nonholonomic mechanical systems 
with linear constraints. For such systems, the momentum map is in general not con¬ 
served, but in certain cases some of its components are conserved. In early studies, it 
was pointed out that a sufficient condition for the conservation of a component of the 
momentum map is that its infinitesimal generator is ‘horizontal’, that is, a section of the 
constraint distribution (see e.g. EE HQ g). It was later proved that the components of 
the momentum map that are conserved are exactly those whose infinitesimal generators 
are sections of 7 Z° [TS]. Our first goal here is to show that this result holds in the case of 
affine constraints, too. In fact, the affine part of the constraint plays no role in it. 

Consider an action T : G xQ — > Q of a Lie group G on the configuration manifold Q. 
For each q £ Q we write as usual 4 , g (q) for 4(g, q). The tangent lift 4 T< 3 : GxTQ —»• TQ 
of the action 4 is the action of G on TQ given by 

^in) = T^ g -n 

(in coordinates, 4^*2 (q,q) = (A> g (q), ^' g {q)q) with 4 9 = F^ 2 -). We denote by Y v := 
^r'F e x P (tr/)|t=o the infinitesimal generator relative to an element 77 £ fl, the Lie algebra 
of G. Correspondingly, the 77 -component of the momentum map of 'F 7 ^ is the function 
J ?; : TQ —> M defined as 

Jv(Vq) ■■= {pi(Vq),Y v (q)) 

(in coordinates, • F g ). 

The tangent lift of a vector field Z on Q is the vector field Z T Q on TQ whose integral 
curves t i-» v(t) are velocities of integral curves t >->• q(t) of Z, that is v(t) = Z{q{t)) G 
T m Q (in coordinates, Z T Q = Zid qi Clearly, YfQ = ^^ X p ( ^)|t=o- 

Consider now a nonholonomic mechanical system with affine constraints (L, Q , M ) 
and assume that L is invariant under namely L o 4^ = L for all g € G. Then, 

we say that the function J v is the momentum of (L,Q,M) generated by Y v . 

Proposition 2. Assume that L is invariant under Then a momentum is a first 

integral of (L, Q, M) if and only if its generator is a section oflZ°. 

Proof. We may work in coordinates. A computation gives -^j(J v \m) = | + 

Rl,m ■ Y v | The invariance of L implies Y^(L) = 0. Thus, J v \m is a first integral if 
and only if, at each q £ Q, Y v annihilates all reaction forces Rl,m(<1,<1) with q £ At g , 
that is, Y v (q) £lZ°. □ 
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4.2. Conservation of gauge momenta. It was an original idea of IT] that, for nonholo- 
nomic mechanical systems with linear constraints whose Lagrangian is invariant under a 
lifted action, certain conserved quantities that are not components of the momentum map 
may be viewed as linked to the group by a gauge-like mechanism. This situation extends 
to nonholonomic mechanical systems with affine constraints. For a general study of this 
topic in systems with linear constraints, and more information on the topic, including 
e.g. its relation to the so called ‘momentum equation’, see [IMS US El- 

Following the terminology of |16] we say that a vector field Y on Q is a gauge symmetry 
of (L,Q,M) relative to the action >1/ if it is everywhere tangent to the orbits of ’3/ and, 
moreover, 

Y T °(L) I# = 0. 

This invariance condition of L is independent of the invariance of L under even 

though it implies that V is ^-invariant. The gauge momentum generated by a gauge 
symmetry Y is the function 

The gauge symmetry that generates a given gauge momentum needs not be unique. 

Proposition 3. A gauge momentum is a first integral of ( L,Q,M ) if and only if it is 
generated by a gauge symmetry which is a section oflZ°. 

The proof goes just as that of Proposition [2] 

The need of considering gauge momenta generated by gauge symmetries that are 
sections of 1Z°. not only those generated by sections of V, is demonstrated by the example 
in the following section. This is the heavy sphere that rolls inside a rotating vertical 
cylinder. This system has as a first integral that depends smoothly on the angular 
velocity of the cylinder and can be interpreted as a gauge momentum. Interestingly, 
when the cylinder is at rest this gauge momentum is generated by a gauge symmetry 
that is a section of T>\ but as soon as the cylinder rotates, the generating gauge symmetry 
leaves V and becomes a section of TZ°. 

4.3. Example. Consider a heavy sphere that rolls without sliding inside a cylinder that 
rotates uniformly about its vertical axis, namely, the system of section 3.2 with potential 
energy V = gz. The Lagrangian L and the configuration manifold Q are independent of 
Q, while the constraint manifold M depends on through the vector field f. We thus 
denote it by Mq. 

It is classically known USE] that, when = 0, the system (L, Q 1 M 0 ) has the two 
first integrals F 0 := F\^ q and K 0 := K |^ , where 

F = Iui z — arj , K = a(ui x cos 7 + ui v sin 7 ) — 2:7 . (12) 

These two first integrals have been linked in [3] to the action T of the group G = 
S 1 x SO(3) 9 (C,S) on Q 9 (z,j,TZ) given by 4 , c,s( z i 7, F) = ( 2,7 + S^TZS), where 

is the matrix of the rotation by ( about the third axis. This action emerges naturally 
in this problem because it leaves the Lagrangian uni and the constraint m invariant. 
With reference to this action, Fq is a momentum generated by an infinitesimal generator 
that is a section of V and I\q is a gauge momentum generated by a gauge symmetry 
which is a section of T>. see [4]. 

When ft ^ 0 , the system (L , Q , Mq) has the two first integrals 

:= ^1 Mn > : = R \Mn 
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which appear in [9]. (Reference [9] considers only the case V = 0, but these two first 
integrals exist for any V = V(z), see the remark below). We now show that, with 
reference to the considered action SR, when Q ^ 0 the function Fq is a momentum 
generated by an infinitesimal generator which is a section of T> and the function Kq is 
a gauge momentum generated by a gauge symmetry which is a section of TZ°, not of V. 
Therefore, at variance from the case il = 0, in order to link the first integral Kq to the 
group action using the gauge mechanism, when Q ^ 0 it is necessary to take into account 
the role of the reaction forces. 

To prove these assertions we pass to the local coordinates (z,'y,ip,ip,0) on Q. The 
representatives F of F and I\ of K are obtained from m with Wj = 9 cos p+ip sin p sin 9 , 
u>y = 8 sirup — ip cosp sin 9, uj z = p + ip cos 9. It follows from the analysis of section 3.2 
that, in these coordinates, 

T> = span R {a<9 7 — rd v ,dg — a sin (7 — p) d z , d^, — a cos (7 — p) sin 9 d z — cos 0 <9 V } 

1Z° = span R {d 7 , d^ , dg — a sin (7 — p) d z , d^ — a cos (7 — p) sin 9 d z — cos 9d v } . 

The tangent spaces to the group orbits are spanned by d 1 and by three infinitesimal 
generators of the SO(3)-action, e.g. the generators 

T] x = sin p(d^ — cos 9 d v ) + cos <p sin 9 dg , 

r] y = cos ip(d^j — cos 9 d v ) — sin 9 sin p dg , 

V* = df 

of the components w x ,Wy,a ; 2 of the SO(3)-momentum map. The vector fields 

Yf -=Vz - 9^, Y k := a (rj x cos 7 - r] y sin 7 ) - 4t9 7 

r 1 sm 9 r z 

are tangent to the group orbits. Yp is an infinitesimal generator of the S 1 x SO(3)-action 
and is a section of V. As such it generates a conserved momentum, that equals Fq,. Yk 
is instead a section of 1Z° and is (the local representative of) a gauge symmetry because 

Y tq (L ) = (z + auj x sin 7 — auj y cos 7 ) 7 

vanishes on Mq, see (HD. Thus, Yk generates a conserved gauge momentum, which 
equals Kq. 

If we use coordinates (z, 7 , p, ip , 9 , ip, ip , 9) on Mq, then 

Fq = (I + a 2 )oj z — a(a + r)I2 , A'n = aw x COS 7 + aw„ sin 7 H— zu z -——SI z . 

TV 

It remains to prove that Kq is not generated by any gauge symmetry which is a 
section of V. To this end, we make the following observations. We call generator of 
a gauge momentum any vector field Z —not necessarily a gauge symmetry—such that 
J = (p£, Z)\ A ~ r Then, a gauge momentum has at most one generator that is a section 
of V. We may prove this in coordinates. If W and Z are the representatives of two 
generators of a gauge momentum J then (W — Z) ■ (Aq — 6 )|m = 0. Equivalently, at each 
point q , (W — Z) ■ (Au + A£ — b) = 0 for all u £ V q . Hence Z and W satisfy the two 
conditions 

(W - Z) ■ Au = 0 Vw € V q , (W-Z)-(A£-b)= 0. 

Since A defines a metric, the first of these two conditions implies that, at each point q , 
W — Z is orthogonal, in this metric, to T> q . Hence all generators of a gauge momentum 
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have the same component along T>. This argument shows that, moreover, if Y is a 
generator of a gauge momentum, then its unique generator which is a section of T>, if 
it exists, is II aY, where, at each point q , 11^ is the A-orthogonal projector onto V q . 
Furthermore, HaY is a generator of the gauge momentum if and only if (II aY — Y) ■ 

(M~b) = 0 . 


a 

I 


In our case II aYk = 


a 2 z a I I az , a cos 9 sin (7 —ip) \ a 
' (a 2 +I)r 2 U ~1 \(a 2 +I)r ~ 1 ' I sin B 1 U V 

cos (7 — q>) do is not a generator of Kq since (JIaYk — Yk) • = 


a sin (7 — tp) 
I sin 6 
I ( a-\-r)Qz 
(a 2 +/)r * 


9-0 + 


Remark. We have considered the case of constant gravity, with potential energy V(z) = 
gz, in order to make a direct comparison with [4], that treats only this case. However, it is 
easy to verify that the two first integrals Fq and A'q exist, and retain their interpretations 
as (gauge) momenta, with the same generators, if the sphere is acted upon by any 
potential energy that depends only on z, that is, which is invariant under the considered 
action T. This is a ‘Noether-like’ property, called ‘weak-Noetherianity’ in [T5j. This 
example thus shows that, in this respect, the case of nonholonomic systems with affine 
constraints differs from that of nonholonomic systems with linear constraints, where the 
only gauge momenta with such a weakly-Noetherian property are those that have a 
generator which is a section of V m 


5. Conclusions 

At the core of Proposition 1 lies the balance equation of the energy 0. Together with 
the assumption of ideality of the constraints it implies that conservation of energy is 
equivalent to 

(R(y),£(v))= 0 VveM, (13) 

that is, to the fact that, in any constrained kinematical state, the reaction force R does 
not do any work on the nonhomogeneous term £ of the constraint. 

If the only knowledge assumed on the reaction forces is d’Alembert principle, namely 
that they annihilate the distribution T> , then all can be deduced from equation m is 
that there is conservation of energy when £ is a section of T >, that is, when the constraint 
is linear. This point of view is quite widespread: the very use of expressions such as 
“undetermined multipliers” or “unknown [reaction] forces” indicates a perception of the 
reaction forces as intrinsically unknown objects. 

The novelty of our approach is that we exploit the fact that, for a given system, the 
reaction forces are known functions of the kinematic states. This allows to identify the 
cases in which the energy is conserved even if the constraint is genuinely affine. 

Very similar considerations can be made concerning the conservation of momenta and 
gauge momenta. 

The general message we convey, besides the specific information about the conser¬ 
vation of energy and (gauge) momenta in nonholonomic mechanical systems with affine 
constraints, is that the reaction forces should be taken into a clearer account in theo¬ 
retical studies of these systems. Particularly when analyzing differences from holonomic 
systems (Noether theorem, conservation of energy, hamiltonianization, perhaps invariant 
measures) it might be important to exploit the presence of the reaction forces in the 
equations of motion. From this point of view, advancement in the comprehension of non¬ 
holonomic mechanical systems is not a purely geometric matter and the comprehension 
of some dynamical aspects might pass through a better understanding—and perhaps 
attempts of classification—of the reaction forces. 
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6. Appendix: The equations of motion and the reaction forces 

We derive here the expression of the reaction forces as functions of the kinematic states 
of the system for a nonholonomic system with affine constraints. 

Proposition 4. In the hypotheses and with the notation of section 2.1: 

1. For any vg £ M there exist unique curves M 9 t Kx (v t ,R t ) £ M x T>° such that 
vo = vo and 

[L\(vt)=Rt W. (14) 

2. There exists a function R £ ^ : M —x D° such that, for all vg, Rt = Ri M$t) Vf. 

3. In any system of bundle coordinates ( q , q) in TQ, if M is described as S(q)q + s(q) = 
0, then the representative Rl,m of R^ ^ is the restriction to M of the function 

EP, IB- 

Proof. We first work in coordinates and then globalize the result. The matrix A = 

a2t o2 t 

FqBq = dqdl nonsingular and independent of the velocities. 

Assume 1 H > ( q t ,q t ,Rt ) is the local representative of a curve in M x V° that satisfies 
(1141) . Then, for all t, 

S(q t )q t + s(q t ) = 0 (15) 

and R t £ rang eS(q t ) T . Thus, there exists a curve t M- A t £ (the Lagrange multiplier) 
such that R t = S(q t ) T Xt- Since [L\ = A(q)q + I(q, q), equation (flTl) is 

Mqt)qt + e(q t ,q t ) = S(q t ) T \ t 

and gives qt = A(q t )~ 1 [S(qt) T Xt — £(qt,qt)]- Inserting this expression into S(qt)qt + 
cr(qt, qt) = 0, that follows from (fl5l) . gives 

S(q t )A(q t )- 1 [S(q t ) T X t - f{q t , q t )\ + <r{q t , q t ) = 0. 

This equation can be solved for A* because the matrix SA~ 1 S T is invertible and gives 

Xt = [S(q t )A(q t )- 1 S(q t ) T r 1 [S'(g t )A( (?t )“ 1 £(g t , q t ) - a(q t , q t )] (16) 

or R t = R L ,M(qt,qt) with Rl,m as stated. Together with the independence of Rl,m of 
the choice of S and s, see the remark at the end of section 2.2, this proves that if Rj A y 
exists, then its local representative is Rl,m and hence the uniqueness of £ i—>- ( qt,qt,Rt ) £ 
M x P°, as in item 1. 

Consider now the equation 

Aq + i = S T {SA- 1 S T )- 1 {SA~ 1 e - a) 

in R n . Let t i—>• qt be its unique solution with initial datum (go, Qo) E M. Define t > Xt 
as in With this choice of A t , the curve t >-> q t satisfies Sq t + cr(q t , qt) = 0, or 

■i[S(qt)qt + s(< 7 t)] = 0. Hence ( qt,q t ) £ M for all t and satisfies equation (ThO) with 
R t = S(q t ) T X t £ T>° t . By d’Alembert principle, such an R t is the representative of a 
reaction force that the nonholonomic constraint can exert. 

This proves a local version of items 1. and 2., which hold true within each coordinate 
chart. 

We now globalize these results. To this end we verify that the local representatives of 
equation (ED. with reaction forces that have the expression (0 , match in the intersection 
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of different chart domains. Let q la q = C(q) be a change of coordinates in Q. The local 
representatives of the affine subbundle M in the new coordinates is still given by an 
equation of the form S(q)q + s(q) = 0 and a simple computation shows that 

S = P [S o C\ C , a = P [s o C] 

where C is the Jacobian matrix of C and P = P{q) is a k x k nonsingular matrix. The local 
representatives L of the Lagrangian in the new coordinates is L(q,q) = L(C(q),C'(q)q) 

4 —' T 

and the matrix A := ° f is given by 


I = C' T [A oC]C'. 


In order to prove the statement it suffices to show that, if t la q t is a solution of 

it q -fr sriSA '' sTr ' {SA ~ H - a) ' 

then t qt = C~ 1 (q t ) is a solution of 


~ - ^ = sFisA-'srr'iSA-'i-*) 

at dq oq 

with t and cr defined as £ and a in ©, but in terms of L, S and s. It is well known from 
Lagrangian mechanics that 


d dL dL lT / d dL dL\ ■ 

dt dq dq A dt dq dq) 

Elementary computations show that 


ii = (C ,T e) i + (C ,T A) ij [^C , Jq\ 
d a = (Pa) a + (PS) aj [d-C'^} 


where C" is the Hessian matrix of the j-th component C 3 of C and with the convention, 
used below as well, that £. A, cr and S are composed with C. Since = PSA~ 1 C'~ T , 

this implies SA~ 1 £ — a = P[SA~ 1 £ — a] so that 


^(sa-'s 7 ') 1 [sA~ 1 e-a] 


C 


(S t (SA~ 1 S t ) l [SA~ 1 £+a])oC 


This completes the proof. 


□ 


References 

[1] R. Abraham and J. E. Marsden, Foundations of Mechanics (Benjamin, Reading 1978). 

[2] C. Agostinelli, Nuova forma sintetica delle equazioni del moto di un sistema anolonomo ed 
esistenza di un integrale lineare nelle velocita. Boll. Un. Mat. Ital. 11 (1956), 1-9. 

[3] V.I. Arnold, Mathematical Methods of Classical Mechanics. Graduate Texts in Mathematics 
60 (Springer-Verlag, New York, 1989). 

[4] L. Bates, H. Graumann and C. MacDonnell, Examples of gauge conservation laws in 
nonholonomic systems. Rep. Math. Phys. 37 (1996), 295-308. 






Conservation of energy and momenta in affine nonholonomic systems 


15 


[5] L.M. Bates and J.M Nester, On D’Alembert’s principle. Commun. Math. 19 (2011), 57-72. 

[6] L. Bates and J. Sniatycki, Nonholonomic reduction. Rep. Math. Phys. 32 (1993), 99-115. 

[7] S. Benenti, A ‘user-friendly’ approach to the dynamical equations of non-holonomic systems. 
SIGMA Symmetry Integrability Geom. Methods Appl. 3 (2007), Paper 036, 33 pp. 

[8] A.M. Bloch, P.S. Krishnaprasad, J.E. Marsden and R.M. Murray, Nonholonomic mechan¬ 
ical systems with symmetry. Arch. Rational Mech. Anal. 136 (1996), 21-99. 

[9] A.V. Borisov, I.S. Mamaev and A.A. Kilin, The rolling motion of a ball on a surface. New 
integrals and hierarchy of dynamics. Regul. Chaotic Dyn. 7 (2002), 201-219. 

[10] A.V. Borisov and I.S. Mamaev, Conservation Laws, Hierarchy of Dynamics and Explicit 
Integration of Nonholonomic Systems. Regul. Chaotic Dyn. 13 (2008), 443-490. 

[11] F. Cantrijn, M. de Leon, D. Martin de Diego and J.C. Marrero, Reduction of nonholonomic 
mechanical systems with symmetries. Rep. Math. Phys. 42 (1998), 25-45. 

[12] J. Cortes Monforte, Geometric, Control and Numerical Aspects of Nonholonomic Systems. 
Lecture Notes in Math. 1793 (Springer-Verlag, Berlin, 2002). 

[13] M. Crampin and T. Mestdag, The Cartan form for constrained Lagrangian systems and the 
nonholonomic Noether theorem. Int. J. Geom. Methods. Mod. Phys. 8 (2011), 897-923. 

[14] R. Cushman, J.J. Duistermaat and J. Sniatycki, Geometry of Nonholonomically Con¬ 
strained Systems. Advanced Series in Nonlinear Dynamics 26 (World Scientific, Singapore, 
2010) 

[15] F. Fasso, A. Giacobbe and N. Sansonetto, Gauge conservation laws and the momentum 
equation in nonholonomics mechanics. Rep. Math. Phys. 62 (2008), 345-367. 

[16] F. Fasso, A. Giacobbe and N. Sansonetto, On the number of weakly Noetherian constants 
of motion of nonholonomic systems. J. Geom. Mech. 1 (2009), 389-416. 

[17] F. Fasso, A. Giacobbe and N. Sansonetto, Linear weakly Noetherian constants of motion 
are horizontal gauge momenta. J. Geom. Mech. 4, 2 (2012) 129-136. 

[18] F. Fasso, A. Ramos and N. Sansonetto, The reaction-annihilator distribution and the 
nonholonomic Noether theorem for lifted actions. Regul. Chaotic Dyn. 12 (2007), 449-458. 

[19] F. Fasso and N. Sansonetto, An elemental overview of the nonholonomic Noether theorem. 
Int. J. Geom. Methods Mod. Phys. 6 (2009), 1343-1355. 

[20] F. Fasso and N. Sansonetto, Conservation of ‘moving’ energy in nonholonomic systems 
with affine constraints and integrability of spheres on rotating surfaces. Preprint (2015). 
http://arxiv.org/abs/1503.06661 

[21] G. Gorni and G. Zampieri, Time reversibility and energy conservation for Lagrangian 
systems with nonlinear nonholonomic constraints. Reports on Mathematical Physics 45 
(2000), 217-227. 

[22] M. Jotz and T. Ratiu, Dirac structures, nonholonomic systems and reduction. Rep. Math. 
Phys. 69 (2012), 5-56. 

[23] M.H. Kobayashi and W.O. Oliva, A note on the conservation of energy and volume in the 
setting of nonholonomic mechanical systems. Qual. Theory Dyn. Syst. 5, 4 (2004) 247-259. 

[24] C.-M. Marie, Reduction of constrained mechanical systems and stability of relative equilib¬ 
ria. Comm. Math. Phys. 174, 2 (1995) 295-318. 

[25] C.-M. Marie, On symmetries and constants of motion in Hamiltonian systems with non¬ 
holonomic constraints. In Classical and Quantum Integrability (Warsaw, 2001), 223-242, 
Banach Center Publ. 59 (Polish Acad. Sci. Warsaw, 2003). 

[26] E. Massa and E. Pagani, Classical dynamics of nonholonomic systems: a geometric ap¬ 
proach. Ann. Inst. H. Poincare Phys. Theor. 55 (1991), 511-544. 



16 


Francesco Fasso and Nicola Sansonetto 


[27] J.I. Neimark and N.A. Fufaev, Dynamics of Nonholonomic Systems. Translations of 
Mathematical Monographs 33 (AMS, Providence, 1972). 

[28] L.A. Pars, A Treatise on Analytical Dynamics. (Heinemann, London 1965). 

[29] E.J. Routh, Treatise on the Dynamics of a System of Rigid Bodies (Advanced Part). 
(Dover, New York, 1955). 

[30] J. Sniatycki, Nonholonomic Noether theorem and reduction of symmetries. Rep. Math. 
Phys. 42 (1998), 5-23. 

[31] G. Terra and M.H. Kobayashi, On Classical Mechanics with Nonlinear Constraints. J. 
Geom. Phys. 49, 3-4 (2003) 385-417. 



